CDS

Accession Number TCMCG075C10082
gbkey CDS
Protein Id XP_017972932.1
Location join(28672029..28672105,28672195..28672246,28672332..28672404,28672499..28672572,28672653..28672712,28672799..28672964,28673396..28673495,28674271..28674353,28674494..28674594,28675085..28675215,28675648..28676149)
Gene LOC18605548
GeneID 18605548
Organism Theobroma cacao

Protein

Length 472aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018117443.1
Definition PREDICTED: flap endonuclease GEN-like 2 isoform X3 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category L
Description Flap endonuclease GEN-like
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K15338        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004518        [VIEW IN EMBL-EBI]
GO:0004519        [VIEW IN EMBL-EBI]
GO:0004520        [VIEW IN EMBL-EBI]
GO:0004536        [VIEW IN EMBL-EBI]
GO:0006139        [VIEW IN EMBL-EBI]
GO:0006259        [VIEW IN EMBL-EBI]
GO:0006725        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008821        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016788        [VIEW IN EMBL-EBI]
GO:0016889        [VIEW IN EMBL-EBI]
GO:0016894        [VIEW IN EMBL-EBI]
GO:0034641        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0046483        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0090304        [VIEW IN EMBL-EBI]
GO:0090305        [VIEW IN EMBL-EBI]
GO:0140097        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGCTCTGAATTTTCATGCATGATCAAAGAAGCAAAAGTCCTTGGATTAGCACTTGGGATTACGTGCTTAGATGGGATTGAGGAAGCTGAAGCACAATGTGCATTATTAAACATAGAATCCTTATGTGATGGGTGTTTCTCTTCTGATTCAGATATCTTTCTTTTCGGCGCAAGAACAGTGTACAGAGACATTTGCCTTGGGGAAGGAGGTCATGTTGTTTGTTATGAAATGGCTGATATAGAGCAAAAACTTGGATTTGGAAGGAACTCCTTGATTTCTCTGGCCCTTCTTCTTGGCAGTGATTACTCTCAGGGTGTTCATGGTCTGGGTCCGGGGTCAGCATGCCAGCTTGTTAAATCAGTTGGAGACCATGATATTCTTCAAAAAGTTGCATCAGAAGGACTGTCTTTTGCGAGGAACACAAAAAGTTCAAGGAAACAGGGTCAAGACAAGTGCAATGACAAGACAACCACATTGCATCATGAAGTGAGCATGAATGGAAGCAATCATAATTTACAAAGAGATAATCAGTATTTGCAAGTGGTGGAAGCATATATGAAGCCCAAGTGCCACTCAGCAGATTCTGATGTAGTCAATAGGGTTCTTGTGCAGCATCCATTTCAGCGCGAGCTACTTCAACAGCTATGTGCTCAGTACTTTGAGTGGCCTCCTGAGAAAACAGATGAATACATCCTTCCTAAGATTGCTGAAAGAGATTTACGACGGTTTGCCAAGTTGCGGTCAGCTTCGTCTCAATTGGGTGTTAACATTCCATTGAAGGAGATACCAGTCAAGTGTCCTGTATCAGTAATTATTAAGCACCGAAAAGTTCATGGAGAAGAATGTTTCGAGGTGTCGTGGGAAGAGCTAGATGGAATCAAAACCTCTGTAATTTCAGCAGATCTCATAAAGAGTGCTTGTCCTGAAAAGATCACTGAGTTTGTTGACAGAAGAGCTCTAGAGAAGAAACATCACCGTAAATCAAGACCAAAGAAATCAGAACAAAAATGTTCTGTGGCAGAAATAGATCTGAAACTCCAAGATCTGTTGCTTGACATCGAGTTAGGAAGCAAGTCCATTCCCATTGCTTCAAGAGAAGTTATATCAGGCAAAATGACCATGGCAACTGAGGGTAATTTCGTAAACCTAGATCCTGAGGTTATTTTGGAGTCAGAAGGCAATGCTGATTGCAAAGCTGTAAGGTTATGCCCACAAACTGGTATGACTGCTCCAAAGCATGAAGTTATTGATCTTTCGAGCCCCTCTCCGCAAGTGCAGTCCCAGAATGTTCCCAGATGCACTGACGTTAGTGTAATTGATTTGAGTGACTCAGAAACTGAGAGGTCACCTGAACATGTGAAGAAAGCAAGAGAGCTTAGATTGTTTCTGGCCAGTATTCGAGATGACATTCATTGA
Protein:  
MGSEFSCMIKEAKVLGLALGITCLDGIEEAEAQCALLNIESLCDGCFSSDSDIFLFGARTVYRDICLGEGGHVVCYEMADIEQKLGFGRNSLISLALLLGSDYSQGVHGLGPGSACQLVKSVGDHDILQKVASEGLSFARNTKSSRKQGQDKCNDKTTTLHHEVSMNGSNHNLQRDNQYLQVVEAYMKPKCHSADSDVVNRVLVQHPFQRELLQQLCAQYFEWPPEKTDEYILPKIAERDLRRFAKLRSASSQLGVNIPLKEIPVKCPVSVIIKHRKVHGEECFEVSWEELDGIKTSVISADLIKSACPEKITEFVDRRALEKKHHRKSRPKKSEQKCSVAEIDLKLQDLLLDIELGSKSIPIASREVISGKMTMATEGNFVNLDPEVILESEGNADCKAVRLCPQTGMTAPKHEVIDLSSPSPQVQSQNVPRCTDVSVIDLSDSETERSPEHVKKARELRLFLASIRDDIH